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(54) AES Encryption circuit 

(57) A round processing unit in an encryption circuit 
comprises: a first Round Key Addition circuit (204) that 
adds a round key value to input data; an intermediate 
register/Shift Row transformation circuit (206) that tem- 
porarily stores the output of the first Round Key Addition, 
circuit (204) and executes Shift Row transformation; a 
Byte Sub transformation circuit (207) Into which the val- 
ues of the intermediate registers hlft Row transforma- 
tion circuit (205) are inputted and which executes Byte 
Sub transformation; a second Round Key AdcRtion cir- 
cuit (208) Into which the values of the intermediate reg- 
ister/Shift Row transformation circuit (206) are Inputted 
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and which ados round key values; a Mix Column trans- 
formation circuit (210) that executes Mix Column trans- 
formation upon the outputs of the second Round Key 
Addition circuit (208); and a second selector (203) that 
outputs to the second Round Key Addition circuit (204) 
one of the outputs of a first selector (202), the interme- 
diate register/Shift Row transformation circuft (206), the 
Byte Sub transformation circuit (207), and the Mix Col- 
umn transformation circuit (210). Such an encryption cir- 
cuit reduces a scale of circuit and can achieve a certain 
level of high-speed processing in the Implementation of 
the AES block cipher. 
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Description 

BACKGROUND OF THE INVENTION 
s Technical Field 

[0001] Trie present Invention relates to an encryption circuit for implementing in hardware the Rijndaei algorithm, 
which is the next generation common key block encryption standard, known as the AES (advanced encryption stand- 
ard), and wlfl replace the current common.key block encryption standard In the US, called DES. 

10 

Description of Related Art 



[0002] A great variety of services are being considered that involve the Internet, Including electronic commerce and 
electronic money. These technologies are used not Just In the daily Dves of individuate, but also in a wide range of 
fields, Including transactions among corporations and Improving productivity. In particular, it is expected that encryption 
functions will be loaded onto smart cards, and mobile handsets, for the purpose of verifying the Identify of Individuals, 
and that these technologies will be widely used for authentication, digital signatures, and data encryption. 
[0003] Common key cryptography Is used In these applications to prevent third parties from tapping on the Internet. 
The current standard adopted In the US for common key cryptography Is DES; as Its replacement, the AES (advanced 
encryption standard), known as the Rijndaei aJgortthm, has been selected to be next generation common key block 
cryptography standard, and this algorithm is becoming the new standard. (The AES draft Is available at http://csrc.nist. 
qov/publtoatlons/drafts/dflps-AES.pdfV 

[0004] AES Is a block cipher for processing in block lengths of 128 bits, and the encryption algorithm, as shown in 
FIG. 1 , is thought to be executable by an encryption circuit comprising a round function unit 20 and a key schedule 
unit 10. The round function unit 20 comprises an input register 21 that temporarily stores I nput data, an XQR processing 
unit 22 that XORs the input data and expanded key segment, a round processing unit 23, a final round processing unit 
24 and an output register 25 that temporarily sto res output data. 

[0005] The round processing unit 23 comprises a Byte Sub transformation unit 31 , a Shift Row transformation unit 
32, a Mix Column transformation unit 33 and a Round Key Addition unit 34; the final round processing unit 24 performs 
the processing of the round processing unit 23 except for the Mix Column traiteforrnatlon 33; It comprises a Byte Sub 
transformation unit 35, a Shift Row transformation unit 36 and a Round Key Addition unit 37. 
[0006] Round processing Iterated; the number of rounds Nr including the final round depends on the key length 
inputted into the key schedule unft 1 0, and is defined as shown in Table 1 . 



[Table 1] 



Key Length and Number of Rqunofe 


Key Length 


Nr. 


128bit 


10 


192brt 


12 


256brt 


14 



[0007] Thus for each key length round processing Is executed NM times, and at the end the final round processing 
is executed. When the key length b128 bits, round processing is executed 9 times; when 192 bits, 11 times; and when 
256 bits, 13 times; and then in each case the final round processing Is executed. Round keys generated at the key 
schedule unit 1 0 are Inputted into the XOR processing unit 22, round processing unit 23 and final round processing 
unit 24. 

[0008] The key schedule unit 10 generates round keys.based on the key generation schedule specffled In the AES 
draft; that algorithm is shown In FIG. 2. 

[0009] The AES Proposal specftTcatJon (AES Proposal: Rijndaei, at http-7/csrc. nlst.gov/encryption/aes/rfjndael/RIJn> 
daeJ.pdf) Introduces 2 hardware Implementations for AES block cipher circuits. 

[0010] One of these is a method for hardware Implementation, in 128 bit units, of all the functions shown in FIG. 1 
as they are (hereinafter, "conventional example 1 ■). In this case, for encryption and decryption, the order of processing 
of the functions is reversed, and thus It is necessary to prepare separate processing circuits for encryption and de- 
cryption. 

[0011] Also, because, as shown in Table 1 f it is necessary to change the number of times round processing is exe~ 
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cuted depending upon the key length, ft Is necessary to create circuits for each key length. 

[001 2] Furthermore, because of the reversal of order between encryption and decryption, the order of key generation 
In the key schedule unit 1 0 forthe round keys used In the round function unit 20 has to be reversed between encryption 
and decryption. Therefore, either there has to be 2 separate key schedule units, for encryption and for decryption, or 

5 a method has to be devised for using the key schedule unit 1 0 for both encryption and decryption . 

[0013] The second method, as shown in FIG. 3, Involves creating a coprocessor 50 that has a Byte Sub transformation 
unit 51 and a Mix Column transformation unit 52, and Implementing in hardware only the Byte Sub transformation and 
the Mix Column transformation functions, and having all other functions incorporated as software into a program 41 , 
and then processing with a CPU 40 (hereinafter, "conventional example 2"). ' 

10 [0014] In this case, Byte Sub transformation and Mix Column transformation, which are unsulted for processing by 
the CPU 40 for reasons of processing time, are imp lemented in hardware as me coprocessorSO, an d the other process- 
ing is processed by the program 41 stored in the CPU, thus allowing the circuit scale to be reduced. 
[0015] If we suppose that the AES block cipher is to be incorporated into a smart card or the like, the functions 
required of an encryption circuit would be to maintain a certain level of processing speed, white keeping the scale of 

is the circuit small. With these requirements, the conventionally proposed method of Implementing all the functions in 
128-bit units results In the scale of circuit being too large, making the loading thereof onto a smart card difficult. With 
the method of Implementing In hardware only the Byte Sub transformation and the Mix Column transformation, and 
processing the other functions with software, there Is the problem of the processing speed requirements not being 
fulfilled. 

[0016] Moreover, with the key schedule unit 10 that generates the round keys, if all the round keys are stored In 
memory, a large^apacity memory Is needed, and this would make the scale of circuit large. Therefore, In order to 
reduce the scale of circuit without reducing processing speed, it is desirable to generate round keys with a circuit 
constitution that does not require storing the entire expanded key in memory. 

2S SUMMARY OF THE INVENTION 

[0017] It is an object of the present Invention to present an encryption circuit that is small in scale and that can achieve 
a certain level of processing speed when Implementing the AES block cipher. 

[0018] The present Invention provides an encryption circuit that generates from a cipher key a plurality of round keys 
30 having a number of bits corresponding to a predetermined processing block length and executing, for each processing 
block length, input data and round key encryption/decryption processing/by means of a round function unit comprising 
an XOR operation unit that XORs the input data and one of the round keys and a round processing unit that iterates 
round processing that Includes Byte Sub transformation, Shift Row transformation, Mix Column transformation and 
Round Key Addition, wherein: 

35 the round processing u nit comprises: a first selector that segments input data Into execution block lengths smaller than 
the processing block length; a first Round Key Addition circuit that adds the round key value to input data for each the 
execution block length; an intermediate register/Shift Row transformation circuit that temporarily stores the output of 
the first Round Key Addition circuit and executes Shift Row transf ormation using the processing block length; a Byte 
Sub transformation circuit wherein the intermediate register/Shift Row transformation circuit value Is inputted for each 

40 the execution block length and Byte Sub transformation Is executed; a second Round Key Addition circuit wherein the 
intermediate register/Shift Row transformation circuit value Is Inputted for each the execution block length and the 
round key value Is added for each the execution block length; a Mix Column transfonnation circuit executing Mix Column 
transformation on the output of the second Round Key Addition circuit; and a second selector that outputs to the first 
Round Key Addition circuit one output from among the outputs of the first selector, intermediate register/Shift Row 

4s transfonnation circuit, Byte Sub transformation circuit, or Mix Column transformation circuit. 

[0019] Here, the execution block length can be a multiple of a bits, the processing block length can be 128 bits and 
the execution block length can be 32 bfts. 

[0020] Further, the key length of the cipher key can be any of 1 28 bits, 1 92 bits or 256 bits. 

[0021] Also, the Byte Sub transformation circuit can comprise a matrix operation unit for decryption that executes a 
so matrix operation on input data; a third selector that outputs either the Input data or the output of the matrix operation 
unit for decryption; an Inverse operation unit for executing an inverse operation on the data outputted from the third 
selector; a matrix operation unit for encryption that executes a matrix operation on the data outputted from the Inverse 
operation unit; and a fourth selector that outputs either the output of the inverse operation unit or the output of the 
matrix operation unit for encryption . 
ss [0022] Further, the matrix operation unit for decryption and the matrix operation unit for encryption comprises an 
XOR circuit so as to perform 8-brt operations at one clock cycle and the matrix operation unit for decryption and the 
matrix operation unit for encryption comprises an XOR circuit so as to perform 1 -bit operations at one dock cycle. 
[0023] Also, the Intermediate register/Shift Row transformation circuit can be used tor both encryption and decryption 
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through the reversal of order of Input of shift data relating to amount of shift for data to be Inputted Into the intermediate 
register/Shift Row transformation circuit, the input order for decryption being the reverse of the order for encryption. 
[0024] Further, the Mix Column transformation circuit can comprise a plurality of multiplication units with unique 
multipliers and an XOR circuit that performs XOR operations for the plurality of muftiplication units, the Mix Column 

5 transformation circuit executing a matrix operation between data inputted Into each multiplication unit and the multiplier 
established for each multiplication unit. In this case, the Mix Column transformation circuit comprises 4 operation units 
having 4 multiplication units capable of 8-bit unit operations and XOR circuits that execute XOR operations based on 
theoutputsofthe4muttrpllcatton units. This multiplication units can control 2 multipliers and are used for both encryption 
and decryption and the multiplication units can be constituted to control addition values from high-order bits. 

10 [0025] Also, an encryption circuit can be constituted so as to have a key expansion schedule circuit that generates 
from the cipher key, as an expanded key segmented into bit numbers corresponding to the execution block length, a 
plurality of round keys with bit numbers corresponding to a predetermined processing block length. The key expansion 
schedule circuit comprises: 

re a fifth selector that segments a cipher key Into the number of bits corresponding to the execution block length and 

outputs the same; 

a shift register to which flip-flop circuits are connected at a plurality of stages, the flrp-flop circuits latching data m 
units of the execution block length; 

a first XOR circuit that XORs the output of the final stage flip-flop circuit of the shift register with one constant 
20 selected from among a group of constants; 

a sixth selector into which are inputted the outputs of those flip-flops of the shift register that are involved in oper- 
ations for encryption and the outputs of those flip-flops involved In operations for decryption, and which selectively 
outputs one of these; 

a Rot Byte processing circuit that rotates the output of the sixth selector; 
2« a seventh selector into whloh the output of the sixth selector and the output of the Rot Byte circuit is Inputted and 

which selectively outputs one of these; 

a Sub Byte processing circuit that executes Byte Sub transformation oh the output of the seventh selectorfor each 
the execution block length; 

an eighth selector Into which the output of the sixth selector and the output of the Sub Byte processing circuit are 
so Inputted, and which selectively outputs one of these; : 

a second XOR circuit that executes an XOR operation based on the output of the first XOR circuit. and the output 
of the eighth selector; and 

a shift register unit selector that selectively outputs, to those flip-flops of the shift register the outputs of which are 
subject to operations for encryption, either the output of the second XOR circuit or the output of the adjacent stage 
33 flip-flop. 

[0026] Here, the shift register comprises 8 flrp-flops executing data processing in 32-bit units, and the sixth selector 
is constituted so that the outputs of the second, fourth, sixth and eighth flip-flops from the bottom from among the flip- 
flops are inputted therein, and that It outputs one of these. 

40 [0027] Also, through the input into the seventh selector of the output of the Intermediate register/Shift Row transfor- 
mation circuit and the Input into the second selector of the output of the Sub Byte processing circuit, a single circuit 
can be used for the Sub Byte processing circuit and the Byte Sub transformation circuit of. the round processing unit. 
[0028] From the following detailed description in conjunction wfth the accompanying drawings, the foregoing and 
other objects, features, aspects and advantages of the present Invention will become readily apparent to those skilled 

45 m the art 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0029] 



50 



FIG. 1 1s a block diagram of AES processing using the Rijndaet algorithm; 
FIG. 2 is a key schedule program fet; 

FIG. 3 is a block diagram showing one envisioned circuit implementation; 

FIG. 4 is a block diagram of a round function unit adopted In a first embodiment of the present invention; 
FIG. 5 is a block diagram showing an intermecRate register/Shift Row transformation circuit; 
FIG. 6 Is a block diagram showing a Mix Column transformation circuit; 
FIG. 7 is a block diagram showing the constitution of a multiplication unit; 
FIG. Bis a block diagram showing another constitution of a multiplication unit; 
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FIG. 9 is a block diagram showing a key schedule unit; 
FIG. 10 is a block diagram showing a Byte Sub transformation circuit; 
FFG. 11 is a block diagram showing a matrix operation circuit for encryption; 
FIG. 12 Is a block diagram showing a matrix operation circuit for decryption; 
s FIG. 1 3 is a block diagram showing another example of a matrix operation circuit for encryption; and 

FIG. 1 4 is a block diagram showing another example of a matrix operation circuit for decryption. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
io Round Function Unit 

[0030] The AES block cipher Is an algorithm that encrypts/decrypts the 128 bit data with the 128 bit, 1 92 bit or 256 
bit key. As shown In FIG. 1 ^comprises a key schedule unit 10 that generates a plurality of round keys from the cpher 
key, and a round function unit 20 that uses the round keys Inputted from the key schedule unit 1 0 to encrypt and decrypt 

is The pund function unit 20 performs such processing as XOR operations, Byte Sub transformation processing, Shift 
Row transformation processing, Mix Column transformation processing, Round Key Addition processing. 
[0031] The first embodiment of the present Invention Is a circuit for implementation oT this round function unit 20, 
and the constitution of this circuit is shown in FIG. 4. Each circuit block executes 32-blt processing with the exception 
of Shift Row transformation processing, which is 1 28-blt processing; transfer of data between circuit blocks is executed 

20 in 32-blt units. 

[00321 This round function unit contains: an Input register 201 that temporarily stores input data; a first selector 202 
that selects 32-blt data from the 128-blt Input data; a second selector 203 Into one Input terminal of which the output 
of the first selector 202 is inputted; a first Round Key Addition circuit 204 into which the output of the second selector 
203 is inputted; an add data selector 205 that inputs Into the first Round Key Addition circuit 204 an expanded key 
segment or u O"; an Intermediate register/Shift Row transformation circuit 206 that stores the output value of the first 
Round Key Addition circuit 204 and executes Shift Row transformation in 128-bit units; a Byte Sub transformation 
circuit 207 Into which intermediate register/Shift Row transformation circuit 206 values are inputted and which executes 
Byte Sub transformation; a second Round Key Addition circuit 208 Into which intermediate register/Shift Row transfor- 
mation circuit 208 values are inputted for each 32 bits; an add data selector 20g which inputs into the second Round 
Key Addition circuit 208 an expanded key segment or "0"; and a Mix Column transformation clrcult21 0 which executes 
Mix Column transformation on the output of the second Round Key Addition circuit 208. The outputs ofthe first selector 
202, Byte Sub transformation circuit 207, Mix Column transformation circuit 210, and. Intermediate register/Shift Row 
transformation circuit 208 are inputted into the second selector 203, and one of these outputs is outputted to the first 
Round Key Addition circuit 204. 

35 

Operation Schedule during Encryption 

[0033] The operation schedule during encryption in the round function unit is shown In Table 2. 

40 
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[Table 2 J 



Round Function Operation Schedule 



5 


Round 


Cycle 


Procassing 


SEUB 




0 


000-003 


Round Key Addition 


a 






004-007 


Byte Sub Transformation 


b 


10 


1 


008 


Shift Row Transformation 


c 


15 




009-012 


Mix Column Transformation 
Round Key Addition 


c 




013-016 


Byte Sub Transformation 


b- 




2 


017 


Shift Row transformation 


c 


20 




oia-021 


Mix Column Transformation 
Round Key Addition 


c 




t 


ummaa 






25 














#1 . 


Byt» Sub Transformation 


. b 




NH 




Shift Row Transformation 


■ c • 


30 




(Nr-1)*9 - 
(Nr-1>9+3 


Mix Column Transformation 
Round Kay Addition 


c 






#2 


Byte Sub Transformation i 


b 


35 


Nr 


Nr*&-1 


Shift Row Transformation 


d 






Nr*9- 
Nr*0+3 


Round Key Addition 


d 



40 



45 



SO 



55 



#1 :(NM)*9-5 - (Nr-1*9-2 
#2:Nr*9-5-Nr*d-2 

Note: The table shows operations during encryption. 
In decryption the order of round key and Mix 
Column processings is switched 

[0034] Here, In round 0, addition of an expanded key segment Is executed by the first Round Key Addition circuit 
204 with a selector position of "a" for the second selector 203. Input data in the input register 201 is selected In 32 bit 
units by the first selector 202 and Inputted into the first Round Key AddGtion circuit 204, and to mis Is added a portion 
of a round key, Inputted from the key schedule unit, this portion being a 32-bit segment of the expanded key. While the 
input data and the expanded key are being changed into 32^bit units, the first Round Key Addition circuit 204 executes 
addition processing, and the XOR processing of the XOR unit 22 in FIG, 1 is thereby executed on 1 28-bH processing 
blocks In the 4 cydes of cycles 000 through 003. The result of the operation by the first Bound Key Addition circuft 204 
Is stored In order In 32-bit units in the intermediate register/Shift Row transformation circuit 206. 
[0035] In round 1 , the round processing 23 in FIG. 1 is executed, and Byte Sub transformation processing 31 , Shift 
Row transformation processing 32, Mix Column transformation processing 33, and Round Key Addition processing 34 
are executed. Thus, first of all, m cycles 004 through 007, with a selector position of 'b 0 for the second selector 203, 
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the data stored In the intermediate register/Shift Row transformation circuit 208, while being shifted in 32-blt un&s, Is 
read out and Inputted Into the Byte Sub transformation circuit 207. At this time, by making the data to be selected by 
the add data selector 205 M 0\ the first Round Key Addition circuit 204 is put into a masked state. The resuit of the 
operations of Byte Sub transformation circuit 207 Is stored In order in 32-bit units In the Intermediate register/Shift Row 
5 transformation circuit 206. Thus Byte Sub transformation processing performs on 128 bits, and the result is stored in 
the Intermediate register/Shift Row transformation circuit 206. 

[0036] Next, in cycle 008, Shift Row transformation processing Is executed. The intermediate register/Shift Row 
transformation circuit 206 is capable of executing Shift Row transformation processing In 128-bit units, and in this cycle 
008, 128-bft Shift Row transformation processing is executed. At this time, the selector position of the second selector 
10 203 may be any position, but in consideration of the processing in the next cycle, a position of V is preferable. 

[0037] In cycles 009 through 0012, Mix Column transformation processing and Round Key Addition processing are 
executed. Herein, the data stored In the intermediate register/Shift Row transformation clrcutt 206, while being shifted 
In 32-blt units, is read out and inputted Into the second Round Key Addition circuit 208. At this time, by making the data 
to be selected by the add data selector 209 "0', the second Round Key Addition circuit 208 is put into a masked state. 
By setting the selector position of the second selector 203 at "C\ the data upon which Mix Column transformation 
processing has been executed at the Mix Column transf oimation circuit 21 0 is Inputted into the first Round Key Addition 
circuit 204 via the second selector 203. An expanded key segment to be Inputted from the key schedule unit is selected 
for data to be selected by the add data selector 205, and this data undergoes Round Key Addition processing at the 
first Round Key Addition circuit 204. The result of the Mix Column transformation processing at the Mix Column trans- 
formation circuit 210 and the Round Key Addition processing at the first Round Key Addition circuit 204 are, while 
being each shifted in 32-bit units, stored in the intermediate register/Shift Row transformation circuit 206. Thus, the 
result of the 1 28 bits upon which Mix Column transformation processing and the Round Key Addition processing were 
executed in cycles 009 through 01 2 are stored in the intermediate register/Shift Row transformation circuit 206. In this 
manner, one round of processing is executed in the 9 cycles of cycles 004 through 012. 
** [003q Next, in rounds 2 through (NM ), the same processing as In round 1 is executed (however, Nr is the number 
of processing rounds Including the final round, and as shown In Table 1 , the number of rounds will differ according to 
key length). 

[0039] In round Nr (the final round), the final round processing 24 of FIG. 1 is executed; this comprises Byte Sub 
transformation processing 35, Shift Row transformation processing 38 and Round Key Addition processing 37. 
so [0040] Thus In cycles (Nr*9-5) through (Nr*9-2), with the selector position of the second selector 203 at "b", data 
stored in the intermediate register/Shift Row transformation circuit 206, while being shifted In 32-bit units, is read out 
and inputted into the Byte Sub transformation circuit 207. At this time, by making the data to be selected by the add 
data selector 205 M fT , the first Round Key Addition circuit 204 is put into a masked state; The resuit of the operation 
of the Byte Sub transformation circuit 207 is stored In order in 32-blt units in the intermediate register/Shift Row trans- 
formation circuit 206. Thus Byte Sub transformation processing of 128 bits is performed, and the result Is stored in the 
intermediate register/Shift Row trarisformation circuit 206. 

[0041] Next, in the (Nr*9-1) cycle, 128-blt Shift Row processing is executed At this time, the selection position of 
the second selector 203 may be any position, but in consideration of the processing of the next cycle, a position of "d* 
is preferable; 

[0042] In the (Nr*9) through (Nr*9+3) cycles, Round Key Addition processing is executed. Specifically by making 
the selector position of th e seco nd selector 203 ?d", the data stored in the Intermediate register/Shift Row transformation 
circuit 206, while being shifted In 32-bit units, is read out and inputted Into the first Round Key Addition circuit 204 via 
the second selector 203. At this time, by making data to be selected by the add data selector 205 ah expanded key 
segment to be Inputted from the key schedule unit, the first Round Key Addition circuit 204 adds 32-bit round keys. 
The result of the Roun d Key Addition processing by the first Round Key Addition circuit 204 Is stored in the inteimedlate 
register/Shift Row transformation circuit 206 while being shifted in 32-blt units. Tnus In the (Nr*9) through (Nr*9+3) 
cycles, the resuit of the Round Key Addition processing on the 128 bits is stored In the intermediate register/Shift Row 
transformation circuit 208. In this manner, in the 9 cycles from (Nr*9-5> through (Nr*9+3), final round processing Is 
executed. 



35 



40 



45 
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Operation Schedule during Decryption 

[0043] Operations during decryption in this round function unit are performed In the reverse order to operations during 
encryption. This operation schedule Is shown in Table 3. 



7 

PAGE 11/135 * RCVD AT 6/5/2006 12:47:54 AM [Eastern Daylight Time] • SVR:USPTO-EFXRF-2/0 ' DNlS:273S300 * CS ID: 66 1-460-1 986 



• DURATION <mm-ss):*W2 



6/4/2006 10:48 PM FROM: 661-460-1986 Huffman Patent Group, LLC TO: 1-571-273-8300 PAGE: 012 OF 135 

EP 1 271 839 A2 

TTable 3 J 

Round Function Operation Schedule 



Round 


Cycle 


Processing 


SELJ3 


0 


000-003 


Round Kev Addition 


Q 




004 


Shift. Row Trsrisfnrmft+irm 


D 




005-008 


Byte Sub Transformation 


b 


i 


009-012 


Round Kov AdrftJnn 
jyiix i^oiurnn i rans^nnaTjon 






I 013 


Shift Row TVancfnrmnt'inn 


b 




014-017 


Byta Sub Transformation 


b 


2 


018-021 


Round Key Addition 

Mix Oojurpn. TransfQnrwtipn 


c 




Omitted 








<Nr-1)*9-5 


Shift Row Transformation 


b 




#1 


Byte Sub Transformation 


b 


Nr-1 


<Nr-1)*9 - 


Round Key Adcftion ' 
J/lix Column Transformation 


c 




Nr*9-5 


Shift Row Transformation 


b 




. #2 


Byte Sub Transformation 


b 




Nr*9- 
Nr*9+3 


Round Key Addition 


d 



#1:(Nr-1>^-(Nr-1)*9-! 
#2:Nr*9-*-Nr*9-1 

[0044] In round 0, with the selector position of the second selector 203 at "a", the first Round Key Addition circuit 

204 adds expanded key segments. Input data In the input register 201 is selected in 32-blt units by the first selector 
202 and Inputted into the first Round Key Addition circuit 204, and from the round key to be inputted from the key 
schedule unit, a 32-blt expanded key segment is added. At this time, data to be Inputted via the first selector 202 is 
inputted In an order that fe the reverse of the input order for encryption, and the input order of the expanded key 
segments to be inputted from the key schedule input is also the reverse of the input order for encryption. In this manner, 
as the Input data and expanded key are changed every 32 bits, the first Round Key Addition circuit 204 executes add 
processing, thereby allowing execution of Round Key Addition processing on a 128-bit processing block in cycles 000 
through 003. The result of the operations of the first Round Key Addition circuit 204 is stored in 32-bit units In the 
intermediate register/Shift Row transformation circuit 206. 

[0045] In round T, processing is performed in the order of Shift Row transformation. Byte Sub transformation, Round 
Key Addition, and Mix Column transformation. For this reason, first, In cycle 004, in the intermediate register/Shift Row 
transformation circuit 206, Shfft Row transformation processing is executed In 1 28-bit units. In this case the processing 
is the same as the Shift Row transformation processing during encryption. Also, the selector position of the second 
selector203 may be any position, but in consideration of the processing in the next cycle, a position of "b" is preferable. 
[0046] Next, in cycles 005 through 008, with a selector position of V for the second selector 203, data stored In the 
Intermediate register/Shift Row transformation circuit 206, while being shitted in 32-bit unite, is read out and Inputted 
into the Byte Sub transformation circuit 207. At this time, by making the data to be selected by the add data selector 

205 V, the first Round Key Addition circuit 204 is put Into a masked state. The result of the operation by the Byte Sub 
transformation circuit 207 is stored in order in the intermediate register/Shift Row transformation circuft 206 in 32-bit 
units. In this case, the Byte Sub transformation processing is executed so as to be the inverse of the transformation 
processing during encryption; this will be discussed below. In this manner, Byte Sub transformation processing is 
performed on 128 bits, and the result is stored in the intermediate register/Shift Row transformation circuit 208. 
[0047] In cycles 000 through 01 2, Round Key Addition processing and Mix Column transformation processing are 
executed. Here, data stored sn the intermediate register/Shift Row transformation circuit 206, while being shifted In 
32-bit units, is read out and inputted into the second Round Key Adoption circuit 208. At this time, data selected by the 
add data selector 209 is made the expanded key segment Inputted from the key schedule unit. Also, with the selector 
position of the second selector 203 at "c - , the output of the Mix Column transf ormation circuit 21 0 is inputted into the 
first Round Key Addition circuit 204 via the second selector 203. At this time, by making the data to be selected by the 
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add data selector 205 "O", the first Round Key Addition circuit 204 is put into a masked state. In this case, Mix Column 
transformation processing Is executed In such a manner as to be transformation processing that Is the inverse of the 
transformation processing during encryption; this will be explained in detail below. Thus the 128-bit resultant of the 
Round Key Addition processing by the second Round Key Addition circuit 208 and of the Mix Column transformation 
s processing by the Mix Column transformation circuit 21 0 is stored In the intermediate register/Shift Row transformation 
circuit 206. In this manner, one round of processing is executed In the 9 cycles of cycle 004 through 012. 
[0048] Next, in rounds 2 through (Nr-1 ), the same processing as in round 1 is executed (however, Nr Is the number 
of rounds including the final round, and as shown in Table 1 , different numbers of rounds are stipulated depending on 
key length). 

10 [0049] In round Nr (the final found), Shift Row transformation processing, Byte Sab transformation processing and 
Round Key Addition processing are executed. 

[0050] For this reason in cycle (Nr*9-5), 128-bit Shift Row transformation processing is executed. At this time, the 
selector position of the second selector 203 may be any position, but in consideration of the processing of the next 
cycle, a position of "b" is preferable. 

*5 [0051] Next, In cycles (Nr*M) through (Nr9-1), with the selector position of the second selector 203 at "b", data 
stored in the intermediate register/Shift Row transformation circuit 206, while being shifted in 32-bit units, is read out 
and Inputted into the Byte Sub transformation circuit 207. At this time, by making the data to be selected by the 205 
"0 K , the first Round Key Addition circuit 204 Is put into a masked state. Result of the operation by the Byte Sub trans- 
formation circuit 207 is stored In order In the intermediate regteter/ShBt Row transformation circuit 206 In 32-bit units. 

20 Tf ws Byte Sub transformation processing is conducted on 1 28 bits, and the result is stored in the intermediate register/ 
Shift Row transformation circuit 206. 

£0052] In cycles (Nr*9) through (Nr*9+3), Round Key Addition processing is executed. Here, by making the selector 
position of the second selector 203 "d', data stored in the intermediate register/Shift Row transformation circuit 206, 
while being shifted in 32-bit units, Is read out and inputted into the first Round Key Addition circuit 204 via the second 
25 selector 203. At this time, by making the data to be selected by the add data selector 205 an expanded key segment 
Inputted from the key schedule unit, 32-bit Round Key Addition processing by the first Round Key Addition circuit 204 
can be executed. The result of the Round Key Addition processing in the first Round Key Addition circuit 204 Is, while 
being shifted in 32-bit units, stored in the Intermediate register/Shift Row transformation circuit 206. Thus In cycles 
(Nr"9) through (Nr*3*3), the 1 28-bit result of Round Key Addition processing is stored in the intermediate register/Shift 
so Row transformation circuit 206. In this manner, the final round processing. is executed in the 9 cycles from cycles 
(Nr*9-5) through (Nr*9+3). Intermediate Value Register/Shift Row Transformation Circuit 
[0053] FIG. 5 shows one embodiment of the Intermediate value register/Shift Row transformation circuit 
[0054] In this constitution, 4 shift registers that process in 8-blt units are provided. The first shift register has 4 flip- 
flops, flip-flops 302, 304, 306 and 308, connected In series, and to each of the flip-flops 302, 304, 306, and 308 selectors 
301 , 303, 305, and 307, which select Inputs, are respectively connected. Input data INO and the output of the flip-flop 
302 are Inputted into the first selector 301 , and either one of these is inputted into the flip-flop 302. Similarly, Into the 
second through fourth selectors 303, 305 and 307, the outputs of the previous-stage flip-flops 302, 304, and 306, as 
weO as the outputs of the flip-flops 304, 306, and 308 are Inputted, and one of these is inputted into the flip-flops 304, 
308 and 308, respectively. 

[0055] The second shift register has 4 flip-flops, flip-flops 312, 314, 316 and 318 connected In series; and to each 
of the fDp-flops 31 2, 31 4, 31 6 and 31 8, selectors 31 1 , 31 3, 31 5, and 31 7, which select Input, are respectively connected, 
input data IN1 and the outputs of the flip-flop 312 and IhefBp-flop 318 are inputted into the first selector 311 ,"and one 
of these is Inputted Into the flip-nop 312. Similarly, Into the second through fourth selectors 313, 315 and 317, the 
outputs of the previous-stage flip-flops 312, 314, and 31 6, as well as the outputs of the flip-flops 31 4, 31 6, and 318 
are inputted, and one of these is Inputted into the flip-flops 31 4, 316 and 318, respectively. 

[0056] The third shift register has 4 flip-flops, flip-flops 322, 324, 326 and 328 connected In series; and to each of 
the flip-flops 322, 324, 326 and 328, selectors 321 , 323, 325, and 327, whfch select input, are respectively connected. 
Input data IN2 and the outputs of the f Hp-flop 322 and the flip-flop 326 are inputted Into the first selector 321 , and one 
of these Is Inputted into the flip-flop 322. Similarly, into the second selector 323, the output of the respective previous- 

so stage fllp^op 322, the output of the flip-flop 324, and the output of the flip-flop 328 ere inputted; and one of these is 
Inputted Into the flip-flop324. Into the third selector 325, the output of the previous stage flip-flop 324, the output of the 
flip-flop 326, and the output of the flip-flop 322 are Inputted, and one of these is inputted into the flip-flop 326. Into the 
fourth selector 327, the output of the previous stage flip-flop 326, the output of the flip-flop 328 and the output of the 
flip-flop 324 are Inputted, and one of these is inputted into the flip-flop 328. 

55 [0057] The fourth shift register has 4 flip-flops, flip-flops 332, 334, 336 and 338 connected in series; and to each of 
the flip-flops 332, 334, 336 and 33 B, selectors 331 , 333, 335, and 337, which select Input, are respectively connected. 
Input data IN3 and the outputs of the flip-flop 332 and the flip-flop 334 are inputted into the first selector 331 , and one 
of these is inputted Into the flip-flop 332. Similarly, Into the second selector 333, the output of the previous-stage flip- 
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13 



flop 332, the output of the flip-flop 334, and the output of the flip-flop 336 are Inputted, and one of these is inputted into 
the flip-flop334. Into the third selector 335, the output of the previous 6tage flip-flop 334, the output of the flip-flop 336, 
and the output of the flip-flop 338 are inputted, and one of these is inputted Into the flip-flop 336. Into the fourth selector 
337, the output of the previous stage flip-flop 336, the output of the flip-flop 338, and the output of the flip-flop 332 are 
inputted, and one of these is inputted into the flip-flop 338. 

[0058] . When an Intermediate value register/Shift Row transformation circuit thus constituted is operated as an in- 
termediate value register for the various processing stages, by inputting data Into Input data IN0 through IN3 in 8-blt 
units : data processed In each cycle In 32-bit units can be stored. Furthermore, by making the selector positions of the 
selectors 301 through 337 "b", and, while shifting the data in flip-flops to the next stage, Inputting data In 8-bit units 
into input data IN0 through IN3 respectively, 128 bits of data can be inputted in 4 cycles. When the Input of 1 28 bits of 
data has been completed, the 4 8-blt data inputted in the first cycle are latched in the flip-flops 308, 31 8, 328, and 338 , 
respectively. 

[0059] An explanation will now be given of the operations of the Shift Row transformation. 
[0060] In the Rijndaei algorithm, input data is segmented into 8-blt data segments aOO through a33 and these are 
processed as a matrix; thedlrection of the shift for decryption is the reverse ofthe direction for encryption. In the present 
invention, the order In which data Is processed is the order of the column array; by processing in reverse order for 
encryption and for decryption, Shift Row transformation can be achieved using the same processing. 



20 



[Table 4] 



Data Array and Processing Order 



25 



Column^ 



Row 





aOI 


a02 


a03 




a11 


al2 


a13 




a21 


a22 


a23 




a31 


a32 


a33 



Encryption 



Row 



Column 



A 


aOO 


aOI 


a02 






a10 


all 


a12 






320 


a21 


a22 


n 


T 


a30 


a31 


a32 





Decryption 



[0061] As shown on Table 4 left, when.the data in rows is arranged in order starting from the column to the far left, 
for Oncryption, processing is executed starting from the column to the far left. For decryption, as seen In Table 4 right, 
processing is executed starting from the column to the far right. 

[0062] In Shift Row transformation processing for encryption, the rows of a data array arranged as oh Table 4 left 
are cyclically shifted different byte-lengths. Specif ically, as shown in Tables, the first row is not shifted, row 2 is cyclically 
shifted one byte to the left, row 3 Is cyclically shifted 2 bytes to the left, and row 4 fe. cycfically shifted 3 bytes to the . 
left. Thlscauses the pre-processing state, shown in Table 5 left, to become the post-processing state shown In Table 
5 right. 
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[Table 5] 
[Encryption] 

Pre-processing Post-processing 



a00 


aOI 


a02 


a 03 




aOO. 


a01 


a02 


a03 


a10 


a11 


a12 


a13 


Cyclic Shift 1 Byte Left . 


all 


a12 


at 3 


a10 


a20 


a21 


a22 


a23 


Cyclic Shift 2 Bytes Left 


a22 


a23 


a20 


a21 


a30 


a31 


a32 


e33 


Cyclic Shift 3 Bytes Left 


□33 


a30 


a31 


a32 
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[0063] For decryption, so as to achieve the inverse of the processing during encryption, the rows of e data array 
arranged as on Table 4. left are cyclically shifted different byte-lengths. Specifically, as shown In Table 5, the first row 
is not shifted, row 2 Is cyclically shifted 3 bytes to the left, row 3 is cyclically shifted 2 bytes to the left, and row 4 Is 
cyclically shifted 1 byte to the left. This causes the pre-processing state, 6hown in Table 6 left, to become the post- 
processing state shown in Table 6 right. 
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[Table 6} 

[ Decryption 3 

Pre-processing 



Post-processing 



oOO 


a01 


a02 


a03 




aOO 


a01 


a02 


a03 


a10 


all 


a12 


a13 


Cyclic Shift 3 Bytes Left 


a13 


a10 


all 


a12 


a20 


a21 


a 22 


a23 


CycBc Shift 2 Bytes Left 


a22 


a23 


a20 


a2t 




a31 


a32 


a33 


Cyclic Shift 1 Byte Left 


a31 


a32 


a33 


a30 



[00G4] In the present embodiment, the Intermediate value register/Shift Row transformation circuit shown In FIG. 5 
Is used. Thus, at the stage when the input of 1 28 bits of data has been completed, the data that was inputted in the 
initial cycle is latched In the final stage flip-flops 308, 318, 328, and 338, and data Is latched In order in the previous 
£5 stage flip-flops. When data Is to be outputted, as tt is being shifted 1 byte to the right at one cycle, data te outputted 
from the final stage flip-flops at the far right Therefore when data is rearranged in consideration of the fact that the 
data processing order starts from the far right, the state before Shift Row processing for encryption takes the form 
shown In Table 7 left. 
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35 
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[Table 7] 

[ Encryption ] 



Post-processing 



a03 


e02 


a01 


aOO 




a03 


a02 


aOI 


aOO 


a!3 


a12 


all 


alO 


Cyclic Shift 1 Byte Right 


a10 


a13 


a12 


all 


a23 


a22 


a21 


a20 


Cyclic Shift 2 Bytes Right 


a21 


a20 


a23 


a22 


a33 


a32 


a31 


a30 


Cyclic Shift 3 Bytes Right 


a32 


a31 


a 30 


a33 



45 



50 



[0OS5] To perform the same cyclic shift as In Table 5, as shown In Table 7 right, the first row is not shifted, the second 
row is cyclically shifted 1 byte to the right, the third row is cyclically shifted 2 bytes to the right, and the fourth row Is 
cyclically shifted 3 bytes to the right 

[0068] In order to perform this kind of Shift Row transformation processing for encryption, the intermediate value 
register/Shift Row transformation circuit shown In FIG. 5 is used to switch and control the selectors, and to replace 
data at once, in 128-bit units. 

[0067] For the first row, because a shift is unnecessary, the selector positions of the selectors 301 , 303, 305 and 307 
are set at °a\ For the second row, because of the cycBc shift 1 byte to the right, the selector position of theselector 
311 Is set at V, and the other selectors 313, 315, and 317 are set at selector position n b n . For the third row, because 
of the cyclic shift 2 bytes to the right, the selector position of the selectors 321 , 323, 325 and 327 is set at "c". For the 
fourth row, because of the cyclic shift 3 bytes to the right, the selector position of the selectors 331 , 333, 335 and 337 
is set at 

I006S] By designating the output data being latched by the flip-flops In the Intermediate value register/Shift Row 
transformation circuit prior to execution of the above-described Shift Row transformation processing as bOO through 
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b33 respectively, as shown in FIG. 5 the output data becomes latched to the output of the flip-flops In an array as shown 
In Table 8 right. 



[Table 8] 



Shift Row Transformation Operation Model 



10 



15 



Prior to Shift Row 



Subsequent to Shift Row 



b03 


b02 


b01 


bOO 




b03 


b02 


bOI 


bOO 


b13 , 


b12 


b11 


blO 


b10 


b13 


b12 


b11 


b23 


b22 


b21 


b20 


4 


b21 


b20 


b23 


b22 


b33 


b32 


b31 


b30 


b32 


b31 


b30 


b33 



20 



[0069] For deciyptlon, because processing Is executed from the right column as In Table 4, the data is arrayed as 
shown In Table 9 left 



30 



[Table 9] 

[ Decryption ] 



a 00 


a01 


a02 


a03 


alO 


a11 


a12 


a13 


a 20 


a21 


a22 


a23 


a30 


e31 


a32 


a33 



Cyclic Shift 1 Byte Right 

Cycfic Shift 2 Bytes Right 
Cyclic Shift 3 Bytes Right 



Post-processing 



aOO 


aOI 


a02 


a03 


a13 


alO 


all 


a12 


a22 


a23 


a20 


a21 


a31 


a32 


a33 


a30 



40 



43 



60 



[0070] To perform trie same cycite shift as In Table 6, as shown In Table 9 right, the first row Is not shifted, the second 
row Is cyclically shifted 1 byte to the right, the third row Is cyclically shifted 2 bytes to the right, and the fourth row is 
cyclically shifted 3 bytes to the right 

[0071] Therefore, as during the above-described Shift Row transformation for encryption, by setting the selector 
values of the selectors In the intermediate value register/Shift Row transformation circuit and performing exactly the 
same processing as the cycfic shift for. encryption as shown in Table 8, Shift Row transformation processing for de- 
cryption can be executed. 

[0072] In this way, the same Intermediate value regteter/Shfft Row transformation circuit can be used for Shift Row 
transformation processing for both encryption andTdecryption. Mix Column Transformation Circuit 
[0073] The Mix Column transformation circuit adopted In this. embodiment is shown In FIG. 6. 
[0074] This Mix Column transformation circuit Includes 4 operation units, a first operation unit 351 , a second operation 
unit 352, a third operation un& 353 and a fourth operation unit 354. The first operation unit 351 comprises afirst mul- 
tiplication unit 381 , a second multiplication unit 362, a third multiplication unit 363, and a fourth multiplication unit 364, 
each of which executes operations in 8-bit units, and an XOR circuit 365 that XORs the outputs of the multiplication 
units 361 through 364. The second operation unit 352, third operation unit 353, and the fourth operation unit 354, which 
are not shown In the figure, also have a first through fourth multiplication unit and an XOR circuit 
[0075] When a column J comprising (aOj, a1j t a2J, a3J) Is transformed Into a column comprising (bOJ, b1j, b2j, b3D, 
the data (bOj, b1j t b2j, b3j) of column J after transformation can be expressed as follows. 
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Encryption 
[0076] 

5 

bOJ = 02*a0j + 03*af J + ora2j + 01 *a3] 
b1J = 01 *a0J + 02*a1j + 03*a2j + 01 *a3j 

w 

b2j = 01 *a0j + 01*a1 j + 02*a2J + 03*a3J 



i 5 b3J = 03*aOj + Ora1J + 01*a2J + 02*a3j . 

Decryption 
10077] 

20 

bOj = 0E*a0J + 0B*a1 j + 0D*a2j + 09*a3] 



b1J = 09*a0j + 0E*a1J + 0B*a2J + 0D*a3j 



b2J = 0D*a0j + 09*a1 j + 0E*a3 + 08*a3j 

30 

b3j = 0B*aOj + 0D*a1j +. 09*ag + 0E*a3J 

(0076] . The coefficients by which each column is multiplied are described as hexadecimal. 
[0079] To execute this Mix Column transformation processing, the 32-blt data columns are Inputted Into the first 
as through fourth operation units 351 through 354, respectively, and multiplication by the first through fourth operation 
units 361 through 384 and the operation by the XOR circuit are performed. . 

[0080] The multiplication units 36T through 364 of the operation units 351 through 361 are provided with a coefficient 
for encryption and a coefficient for decryption, so mat they can be used for both encryption and decryption, and they 
are constituted so that selection of a coefficient can be made during operations. 
40 [0081] The first multiplication unit 361 of the operation unit 351 can multiply Inputted data by either 0x02 or OxOE. 
The second multiplication unit 362 can multiply inputted data by either 0x03 or OxOB. The third multiplication unit 363 
can multiply Inputted data by either OxOi or OxOD. The fourth multiplication unit 364 can multiply inputted data by either 
0X01 pr0x09. 

[0082] The first multiplication unit of the second operation unit 352 can multiply Inputted data by either 0x01 or 0x09. 

45 The second rnultip licatJon unit can multiply Inputted data by either 0x02 or OxOE. The third multiplication u nit can multiply 
inputted data by efther 0x03 or OxOB. The fourth multiplication unit can multiply inputted data by either 0x01 or OxOD. 
[0083] The first multiplication unit of the third operation unit 353 can multiply Inputted data by either 6x01 or OxOD. 
The second multiplication unit can multiply inputted data by either 0x01 or 0x09. The third multiplication unit can multiply 
inputted data by either 0x02 or OxOE. The fourth multiplication unit can multiply Inputted deia by efther 0x03 or OxOB. 

so [0084] The first multiplication unit of the fourth operation unit 364 can multiply inputted data by either 0x03 or OxOB. 
The second multiplication unft can multiply inputted data by either 0x01 or OxOD.The third multiplication unit can multiply 
inputted data by either 0x01 or 0x09. The fourth multiplication unit can multiply Inputted data by either 0x02 or OxOE. 
[0085] By changing the coefficients used for encryption and for decryption In the first through fourth multiplication 
unite of the first through fourth operation units 351 through 354, the same circuit constitution can be shared for both 

55 encryption and decryption. Multiplication Units of the Mix Column Transformation Circuit 

[0086] An example of the multiplication units included in the Mix Column transformation circuit is shown in FIG. 7. 
[0087] The multiplication units multiply Inputted 6-bit data (a7. a6, a5, a4, a3, a2, a1 , aO) with a coefficient (b3, b2, 
bl, bO). For this, partial product operation units 375 through 378 are provided, which multiply the 8-bit data (a7, a6, 
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a5. a4, a3, a2, a1 , aO) with each bit of a coefficient <b3, b2, b1 , bO). Also provided are: an addition unit 371 that shifts 
the result of the partial product unit 376 1 bit and adds this to the result of the partial product unit 375, which rnuttplies 
using the highest bit of a coefficient; an addition unit 372 that shifts the resultant of the partial product unit 377 1 bit 
moreover and adds this; and an addition unit 373 that shifts the resultant of the partial product unit 37B 1 bit moreover 
5 and adds this* There Is also provided a division unit 374 Into which the resultant of the addition unit 373 and overflow 
carried over from the addition units 371 to 373 are inputted and divided by a divisor. 

[0088] With this constitution, by selectively setting as the coefficient (b3, b2, b1 , bO) a coefficient for encryption and 
a coefficient for decryption, the mixed column transformation processing can be used both for encryption and for de- 
cryption. 

10 [0089] As described above, there are 2 coefficients, set as (b3, b2, b1 , bO), established for each multiplication unit. 
There are 4 combinations of coefficients in the multiplication units, namely, (0x02, OxOE), (0x03, OxOB), (0x01 , OxOD), 
(0x01, 0x09). When these are expressed as 4 low order bits, they become (0010, 1110), (0011, 1011), (0001 , 1101), 
and (0001 , 1 001). The operations for common bits in these coefficients do not perform control of the partial products; 
rather, the operations for different bits control the addition processing; this allows the circuit to be reduced In scale. 

is [0090] For example, when the coefficients are the combination (0x01 , OxOD), they become (0001 ,1101) when ex- 
pressed in binary; by controlling whether or not the result of the addition of the partial product of the 2 upper bits is 
added to the partial product of the iower2 bits, the selection and multiplication of 2 coefficients becomes possible. FIG. 
8 shows the circuit constitution for the coefflctent combination (0x01 , OxOD). 

[0091] In FIG. 8, a first addition unit 381 that shifts inputted 8-bit data (a7, a6, a5, a4, a3, a2, a1 , aO) 1 bit and executes 
20 addition processing thereupon. The output of the first addition unit 381 is inputted into a second addition unit 383 via 
a control logic circuit 382. This second addition unit 383 adds the result of the partial product operation by the uppermost 
bit of the coefficient, and it is constituted to shift inputted 8-bit data 3 bits and execute addition processing thereupon. 
[0092] A division un it 384 is provided Into which the resultant of the operation of the addition unit 383 and the overflow 
carried over from th e first addition unit 381 and the second addition unit 383 are inputted and divided by a divisor. 
25 [0093] The control logic circuit 382, when a coefficient is 0x01 , does not output the output of the addition unit 381 , 
. which Is an upper 2-bft resultant. The control logic circuit 382 may be constituted so that, when a coefficient is OxOD, 
the output of the first addition unit 381 , which is an upper 2 bit result, Is outbutted to the addition unit 383 
[0094] Because the multiplication performed here is multipficatlon over GF (2*) where the irreducible polynomial is 
M(x) e x* -i- x* + x* + x +1 , and the addition is over GF{2), they can be achieved with an XOR operation. 
30 [0095] In this manner, by controlling theaddttlon of partial products in different bits of 2 coefficients, the circuit scale 
can be made smaller, enabling reduction of the scale of circuit. Key Schedule Unit 
[0098] FIG. 9 shows the circuit constitution of the key schedule unit. 

[0097] The key schedule unit comprises, primarily, an expanded key generation logic unit 101 , an expanded key. 
register 120 and a key input register 131 . 
as [0098] The key input register 131 is a 256-blt register comprising 8 32-bit registers kO through k7, and a cipher key - 
is stored in 32-blt units starting from register kO and proceeding in order therefrom. When the cipher key is 256 bits, 
data is stored in all the registers kO through k7; when the cipher key is 1 92 bfts, data is stored in registers kO through 
k5, and when the cipher key is 1 28 bits, data is stored In registers kO- through k3. 

[0099] A selector 132 that selectively, outputs one value from the registers kO through k7 is connected to the key 
40 input register 1 31 . This selector 1 32 selects 32 bits of data from the 256-bit data of the key Input register 1 31 and 
inputs this at the lowest position of the expanded key register 1 20. 

[0100] the expanded key register 120 is a shift register to which are connected In series 8 flip-flops 121 through 
128, which are capable of processing in 32-bit units. Inputted into the flip-flop 128, which Is at the lowest position, is 
the output of the selector 1 1 3, which selects the output of the selector 1 32 and the output of the expanded key generation 
43 joglc unit 1 01 . The output W7Key of the flip-flop 128 Is bipuned Into the flip-flop 1 27. The output W6Key of the flip-flop 
127 is inputted Into the selector 112, which Is at the stage previous to the fip-f lop 126. Inputted into the selector 112 
is the output W6KEY of the flip-flop 1 27 and the output of the expanded key generation logic unit 1 01 , and one of these 
Is Inputted into the flip-flop 126. 

[Old] the output W5KEY of the flip-flop 126 is Inputted Into the flip-flop 125, The output W4Key of the flip-flop 125 
so is inputted into the selector 111 , which Is at the stage previous to the flip-flop 124. Inputted Into the selector 11ils the 
output W4KEY of the flip-flop 125 and the output of the expanded key generation logic unit 101 , and one of these is 
inputted into the flip-flop 124. 

[0102] The output W3KEY of the flip-flop 1 24 is inputted Into the flip-flop 123. The output W2KEY of the flip-flop 123 
Is inputted into the flip-flop 1 22. The output W1 KEY of the fiip-flop 1 22 Is Inputted into th e flip-flop 121. 
53 [0103] The expanded key generation logic unit 101 Includes a ROM 102 in which an expanded key generation con- 
stant Rcon is stored, an AND circuit 103 that ANDs a value read out from the ROM 1 02 and a signal RCOW_EN, and 
an XOR clrcuft 1 04 which XORs the WOKEY of the flip-flop 1 21 positioned at the top of the expanded key register 1 20 
and the output of the AMD circuit 1 03. which have been inputted therein. 
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10104] The expanded key generation logic unit 101 also includes a selector 1 05, Into which the flip-flop 122 output 
W1 KEY. the flip-flop 124 output W3KEY, the flip-flop 126 output W5KEY, and the flip-flop 128 output W7KEY are In- 
putted, and which selectively outputs one of these. The output of the selector 105 is inputted into the Rot Byte circuit 
1 06, which rotates data, the selector 107, and selector 1 09. The output of the Rot Byte circuit 1 06 and the output of 

s the selector 1 05 are inputted Into the selector 1 07, which suppDes one of these to the Sub Byte circuit 1 08. The Sub 
Byte circuit 10B executes Byte Sub transformation processing in 32-bit portions, and supplies the output thereof to the 
selector 1 09. Into the selector 1 09 are Inputted the output of the Sub Byte circuit 1 08 and the output of the selector t 
1 06. one of which It outputs. The expanded key generation logic unit 1 01 also includes an XOR clrcuft 110. The output 
of trie XOR clrcuft 1 04 and the output of the selector 1 09 are inputted into the XOR circuit 110. which then XORs these 

10 outputs. 

[0105] A key schedule unit thus constituted includes such functions as: 1) generation of the expanded key used in 
the Round Key Addition processing of the round function unit; 2) rewrite of the key input register during encryption, 
and setup of the expanded key Initial value following completion of encryption and decryption; and 3} setup of expanded 
key Initial value following rewrite of the key input register during 

« [01O6J The round keys used In Round Key Addition processing of the round function unit must total 15, from the 
Initial round key and round key 01 through round key 14, when the key length is 256 bits. Each round key is made up 
of 1 2B bits, In correspondence with the processing block length; In order to assign the round keys to the 32-blt expanded 
key segments generated by the key schedule unit, a total of 60 expanded key segments WOO through W59 are required. 
These expanded key segments WOO through W59 are used in the order W00.->W59 for encryption, and in the order 

so W59->W00 for decryption. In this embodiment, as shown in Table 10, expanded key segments are generated in the 
order W00->W59 for encryption, arid in the order W59-»VV00 during decryption. 
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[Table 10 ] Expansion Key Schedule (This Example for 256-Bit Key Length) 



10 



15 



20 



25 



30 



35 



40 



No. 


Encryption 


Decryption 


00 


YTOQ=tkO) 


W59 * 


01 




WS8 


02 


W02=0O) 


W37 


03 


Vtt3=(k3) 


W56 


04" 


W04=0(4) 


W55 


OS 


W05=(k5) 


W54 


06 


wo*=<ke) 


W53 


07 


W07=fr7> 


W52 


08 


W08=W0O*Sub ByteCRat Byte(W>7»-Rc<m£t] 


WS1=WB0*W58 


09 


wo9=woi*wob 


W50=W58 J> W£7 


10 


W10=W02"WQ9 


W49^W57^W56 


11 


WU=YWTWIO 


W48=^r5«*Sub BytcCRot Byte(W85)rRconC7] 


12 


W 2*W)4~Sub BvtcCW1 1 ) 


W47=W55*W54 


13 


Wl3=%TO5"*W12 


W4&=Yf54^V53 


14 


W14=VJ08^3 


WS=W53*W32 


16 


W15=V«07^W14 


W44«W32'5i* SytefllWI) 


16 


W16=V»rSub Byte<Roi ByteWt STRcanE] 


W43=W1"VW0 


17 


W17=W0B>16 


Vf42=W30^49 


18 


W18=W10TW17 


W41=W49~W4B 


19 


W19=W11*W1B 


W40=V¥48~S«b Byti^Rot Byla(W47>rRccn£fi3 


20 


W20=Wl2 - Sab BytoCW19) . i 


W39=W4T*W4B 


11 


W21=W3"W2Q 


TO7=W4B*W45 


22 


W22=W14>V21 


W3€=WrW44 j 


23 


W23=W13TW22 


Vy33^rt4^S«bByte(W43> i 




Owrftted 




52 


W52=W44~Sub BytKW51) 


W0T=W1S>/14 


53 


W53=W4TW52 


W06=W14 - W13 


34 


W54=W4B^W53 


YW*=W13* , W12 


33 


W35=W47"Ym 


W04=W1 2~Sub ByteOffl 1) 


56 


W5&=W4B"Sub BytrfRct BrtetWSBH* RcOnTTl 


msswii*wicr 


37 


W57=W4»'YfB6 




58 


W5S=W50~W57 


VW1=W0TW08 


59 


W5S=W51"Y/5B 


WOO^DTSub BytafRot Byta(VW)7)m:onCll 



1 



Initial 

Round Key 



KeyOI 



Round 



Round 
KovOl 



Round 
Kev04 



Round 
KayOS 



► Round 
Key13 



> Round 
Ksy14 



45 



[0107] The expanded key segment W08 for encryption, in accordance with the formula WOB^WOC^Sub Byte(Rot 
Byte(W07))*Rcon[1 ], is obtained by XORlng WOO, Sub Byte(Rot Byte(W07) and theconstant Rcon[1 ]. Because A A A= A , 
the expanded key segment Woo can be expressed as WO0=WO8 A Sub Byte(Rot ByteCWO^Rconll], meaning that 
WOO can be generated from W08 and W07. Thus, for decryption, first W00=>W59 are generated, and then In the order 
that is the inverse of encryption, i.e., W59=>WO0, expanded key segments are generated. In this manner, mere Is no 
need to store all the expanded keys for decryption in memory, making possible decryption processing wherein only 
the expanded key segments needed for each round are generated. 

[0108] An explanation will first be given of the generation of expanded key segments for the Round Key Addition 
function of the round function unit 

[0109] As shown in Table 1 0, in the Round Key Addition function In each round, 4 expanded key sepjnents having 
32 bits are used; because expanded key operations are performed m the background of the Mix Column transformation 
+ Round Key Addition function of the round function, 4 expanded key segments may be created In 4 cycles. For thrs 
reason, in a circuit constitution as shown in FIG. 9, 1 expanded key segment is generated in 1 cycle. The expanded 
key segment register 120 comprises a shift register, and the expanded key segments currently being used In a round 
function use the output W0KEY of the flip-flop 1 21 . 
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[0110] The selector 105 (SEL_B) of the expanded key generation logic unit 101 , as shown In Table 11 , is controlled 
eo as to switch depending upon 2 different types of conditions, namely, key length and encryption/decryption. The 
selectors 111, 112, and 113 (SEL^E through SEL_G), into which the output of the expanded key generation logic unit 
101 is inputted, are set based on key length, as shown In Table 12. However, when a cipher key is inputted as an initial 
value, V is selected as the selector position for the selectors 111 through 113. The selectors 107 and 109 (SEL_C, 
SELJ3), as shown in Table 13, are controlled so as to switch depending upon the expanded key generation Jogte. The 
ROM 1 02 stores the constant Rconp], which is inputted to the XOR circuit 104, and the constant Rcon[i] corresponding 
to the address T is stored as shown In Table 14. 
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SEL_E through SELJ3 Control 
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Rcon ROM Table 
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[Table 14J (continued) 



Rcon ROM Table 
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[0111] An explanation will be given of circuit operations when the key length is 256 bits, as shown In Table 1 0. Prior 
to operation of the round function, through the loading of the values of the registers kO through k7 of the key Input 
register 131 , the initial values from No. 00 through No. 07 are set in the flip-flops 121 through 12B of the expanded key 
register 120. 

[01121 The expanded key segment W0B for encryption is computed, as shown in Table 10, with the operation 
WOB=W0O*Sub Byte(Rot Byte(WG7))ARcon[1]. At the beginning of this operation W08=W0O*Sub Byte(Rot Byte(W07)) 
ARcon[1], WOO is set at the output WOKEY of the flip-flop 121 and Inputted into the XOR circuit 1 04. W07 Is set at the 
output W7Key of the flip-flop 128, and this W07 is inputted into the selector 105 (SEL_B). 

[0113] The Rcon address of the ROM 102 Is made "1" and the signal RCON_EN to be Inputted Into the AND circuit 
1 03 is enabled; the Rcon[1}*W00 operation Is performed by the XOR circuit 1 04, and the result thereof is Inputted into 
the XOR circuit 1 1 0. Meanwhile, W07i having passed through the selector 1 05 (SEU_B), Is processed by the Rot Byte 
circuit 106 and the Sub Byte circuit 1 08; the result of the Sub Byte(Rot Byte(W07)) operation is inputted into the XOR 
circuit 110. Thus the XOR circuit 110 performs the W08=WGO*Sub Byte<Rot Byte(W07))*Rcon[1] operation. 
[01 14J An explanation will next begiven of the expanded key segment W09^W01*W08 operation processing. At the 
beginning of the W09=W01 A W08 operation, W01 is set at the output WOKEY of the flip-flop 1 21 and then inputted Into 
the XOR circuit 1 04. W08 is set at the output W7KEY of the flip-flop 128, and Inputted Into the selector 1 05 (SEL^B). 
The signal RCOIM_EN to be inputted Into the AND circuit 1 03 is disabled, and W01 to be inputted from the flip-flop 1 21 
Is set so as to inputted Into the XOR circuit 110. At this time, the selector 109 (SEL_D) Is set at selector position "b", 
and W08, having passed through the selector 105 (SEL_B), is inputted into the XOR circuit 110. . 
[0115] Thus the XOR circuit 110 performs the WO9=WO1AW08 operation. The operations for W10, Wll and W13 
through W1 5 are perf ormed along the same path. 

[0116] The expanded key segment W12 operation processing will now be explained. The expanded key operation 
W1 2=W04 A Sub Byte(Wl 1 ) is performed; at the beginning of this operation, W04 is set at the output WOKEY of the flip- 
flop 1 21 , and inputted Into the XOR circuit 1 04. W1 1 is -set at the output W7KEY of the flip-flop 1 28. and inputted into 
the selector 105 (SEU_B). The signal RCON_EN to be inputted into the AND circuit 103 is disabled, and W04 Is set 
so as to be inputted into the XOR circuit 1 04. Meanwhile, the selector position of the selector 107 (SEl^C) Is set at 
^b", and W11 , having passed through the selector 1 05 (SEL_B). Is inputted into the Sub Byte circuit 1 0B via the selector 
107 (SEL_C). Thus the Sub Byte circuit 108 performs Sub. Byte processing, and the result of the Sub Byte{W11) 
operation is inputted Into the XOR circuit 110. Thus the XOR circuit 130 performs the W12=W04 A Sub Byte(Wll) op- 
eration. 

[01 17] In the above manner, operations for ail the expanded key segments are performed. 
[01 1 8] Next, an explanation will be made of the rewrite of the key input register 1 31 for encryption and the setup of 
the expanded key initial following completion of encryption and decryption. This setup operation is an operation In 
preparation for the subsequent encryption or decryption, in which an expanded key initial value Is transmitted to the 
expanded key register 1 20. 

[01 19] An expanded key initial value set at the key input register 131 undergoes 32-blt unit data selection by the 
selector 132 (SEU.A), and is set at the expanded key register 120 via the selection position "b" of the selector 113 
(SEU.G): The expanded key register 120 is constituted as the shift register described above, shitting, data along the 
direction of flip-flop 128 (FF7) => flp-flop 127 (FF8) => flip-flop 128 (FF5) => flip-flop 125 (FF4) => flip-flop 124 (FF3) 
=> f Bp-flop 123 (FF2) flip-flop 122 (FF1) flip-flop 121 (FF0), transmitting all the expanded key initial values in 8 
cycles. The key Input data to be selected by the selector 132 (SELJ\) is in the order of the registers kO, k1.k2.k3, k4, 
k5, k6, k7 of the key input register 131. 

[01 20] An explanation will be given of expanded key Initial value setup following the rewrite of the key Input register 
131 for decryption. As shown In Table 10, In decryption, the expanded key Initial value must be made the final expanded 
key segment set during encryption, namely W59 through W52. Through the rewrite of the key input register 1 31 , the 
data that Is set at the key input register 131 Is, In the manner described above, first transmitted to the expanded key 
register 120, and in accordance with the expanded key generation logic for encryption, the circuit of FIG. 9 is caused 
to operate up through the final expanded key segment set, namely W52 through W59. 

[0121] As this final expanded key segment set Is being generated, during generation of W52, W52 is transmitted to 
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the register k7 of the key input register 131 ; during generation of W53, W53 is transmitted to the register k6; during 
generation of W54. W54 is transmitted to the register k5; during generation of W55, W55 Is transmitted to the register 
k4; during generation of W56, W56 is transmitted to the register k3; during generation of W57, W57 Is transmitted to 
the register k2; during generation of W58, W58 is transmitted to the register k1; during generation of W59, W59 is 
s transmitted to the register k0; thus the final expanded key segment is set in the reverse order in the key input register 
131 . Moreover, by transmitting the final expanded key segment set of the key Input register 131 to the expanded key 
register 1 20 In the manner described above, the setup of the expanded key initial value following the rewrite of the key 
input register during decryption is completed. 

[0122] Thereafter, the selector 1 05 (SEL^B), selector 1 07 (SEL_C), selector 109 (SEL_D), and selectors 111 through 
ft> 113 (SEL_E through SEkJa) are set at selector positions as shown In Tables 11 through 13, and the expanded key 
segments needed for decryption are generated In order. Shared Use of the Byte Sub Transformation Circuit 
[0123] Because the above-described Sub Byte processing of the key schedule unit and Byte Sub transformation 
processing of the round function unfc both execute Byte Sub transformation processing In 32-bft units, a single circuit 
can be used for both these processings. 
is [0124] For example, let us consider using the Byte Sub circuit 108 provided in the key schedule unit shown in FIG. 
9 as the Byte Sub transformation circuit of the round function unit. 

[0125] The input 8SIN Into the Byte Sub circuit 207 from the intermediate register/Shift Row transformation circuit 
206 In the round function unit shown In FIG. 4 connects with selector position "c" of the selector 107 of the expanded 
key generation logic unit 1 01 shown In FIG. 9. The output from the Sub Byte circuit 1 08 of the expanded key generation 
so logic unit 1 01 connects to the selector 203 as the output BSOUT of the Byte Sub transformation circuit 207 of FIG. 4. 
[0125] When using the Sub Byte circuit 108 to perform Byte Sub transformation processing, as shown In Table 13, 
with the selector position of the selector 107 <SEL_C) at "c", the selector position of the selector 1 09 (SELJD) is set 
at tt b*. In this manner, the Sub Byte circuit 108 of the expanded key generation logic unit 101 can be used to execute 
the Byte Sub transformation processing of the round function unit. Byte Sub Transformation Circuit Byte Sub transfor- 
mation processing is a combination of an inverse operation in 8-bit unfts and a matrix operation; for encryption, after 
the performance of an inverse operation, a matrix operation is performed; for decryption, after the performance, of a 
matrix operation, an inverse operation Is performed. In order to implement such Byte Sub transformation processing 
using a common circuit for both encryption and decryption, a circuit as shown in FIG. 10 Is hereby proposed. 
[01 27] A Byte Sub transformation circuit 391 as shown In FIG. 10 comprises a matrix operation circuit for decryption 
392, a selector 393, an inverse operation circuit 394. a matrix operation for encryption 395, and a selector 396. 
[0128] The selector 393 is constituted so that Input data and the output of the Inverse operation circuit 392 are 
inputted therein, of which one is inputted to the Inverse operation circuit 394. The selector 396 is constituted so that 
the output of the inverse operation circuit 394 and the output of the matrix operation for the encryption circuit 395 is 
Inputted therein, of which one Is outputted. 
as [0129] During encryption, me selector 393 Is on the Input data side, and the selector 396 is on the matrix operation 
for encryption 395 side. During decryption.the selector 393 is on the matrix operation for decryption 392 side, and the 
selector 39B is on the inverse operation circuit 394 side. In this manner, Byte Sub transformation processing for en- 
cryption and Byte Sub transformation processing for decryption can be accomplished using a common circuit consti- 
tution. 

40 [0130] The matrix operation Tor encryption can be expressed as the following expression 1 . 
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[0131] As this is expanded, it can be expressed as the following expression 2. The V below means ah XOR oper- 
ation.- 
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[0132] The matrix operation for decryption can be expressed as the following expression 3. 
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[Expression 3] 
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[0133] As this le similarly expanded, It can be expressed as the following expression 4. 
(Expression 4] 



y 0 = X 2 

Vi = x 0 

y 2 = x, 

y 3 = Xo + x 2 



+ x s + X, + 1 

x a + x e 

+ x 4 . + x 7 + 1 



y 4 = 

y 5 = 

y e = Xo 

y 7 = 



+ x 3 
Xa + x 4 



* x, 



6 



+ X, 



+ X 4 - , • + X 6 



[01 34] An example of a matrix operation circuit for encryption Is shown in FIG. 11. 

[0135] This circuit comprises an 8-bit input register 401 , an output register 403, and a logic circuit 402 comprising 
so XOR and NOT gates. The execution of the XOR operation shown In expression 2 for encryption can be achieved 
through 1 6 XOR gates and 4 NOT gates by having XOR circuits in the logic circuit 402 share overlapping operations. 
[0136] An example of a matrix operation circuit for decryption is shown in FIG. 12. 

[0137] Simllarto thematrlxoperattonclrcultfor encryption, this circuft comprises an 8-bit input register 405, an output 
register 407 and a logic circuit 406 comprising XOR and NOT gates. As with tfce matrix operation circuit Tor encryption, 
ss the execution of the XOR operation shown in expression 2 for encryption can be achieved through 1 3 XOR gates and 
2 NOT gates by having XOR circuits In the logic circuit 406 share overlapping operations. 
[0138] Another example of a matrix operation circuit for encryption \s shown in FIG. 13. 

[0139] This matrix operation circuit for encryption comprises an input register 411. an output register 414, a shift 
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register for holding constants 41 3, and a logic circuit 41 2 comprising XOR circuits. The input register 41 1 , output register 
414 and a register for holding constants 413 are all 8-blt shift registers that are synchronized with a clock to make 
cyclic shifts 1 bit to the right. 

[0140] The constants In the first right column of expression 1 are constituted so that each line has 3 0*8 and 5 1's 

5 and shifts 1 bit at a time. Then, as bits xO, x4, x5, x6, x7 of the Input register 411 are cyclically shifted, they are Inputted 
into the logic circuit 41 2 and XORed; thus the matrix operation of the first right column of expression 1 is performed. 
(01 41J The constants in the second column from the right in expression 1 are set in the register for holding constants 
413 , starting from the lower bits. As the values of the register for holding constants 41 3 are cyclically shifted, the values 
of the lowest-order bits are inputted Into the logic circuit 412 and XOR operations are performed; thus the matrix 

10 operation of the second column from the right of expression 1 is performed. 

10142] When data Is set at the input register 411 in this manner, at the first clock cycle operations are performed on 
yO, and the result is then stored In the output reglster414. At the next clock cycle operations are performed on y1 , and 
the result Is then stored in the output register 414. Operations are then performed in order so that with 6 clock cycles 
the operations on (y7, y6, y5, y4, y3 y2 , y 1 yO) are completed. The logic circult41 2 can in this case execute the operation 

15 processing of expression 2 using 5 XOR circuits. 

[0143] An example of another matrix operation circuit for decryption, with a similar constitution, Is shown in FIG. 14. 
[0144] This matrix operation circuit for decryption comprises an input register 415, an output register 41 8, a register 
for holding constants 417 and a logic circuit 416 comprising XOR circuits. The input register 415, output register 41 8, 
and registerfor holding constants 417 are all 8-bit shift registers that are synchronized with a clock to meJcecydfc shifts 

so i bit to the right. 

[0145] The constants in the first-right column of expression 3 are constituted so that each line has 3 0's and 5 1's 
and shifts 1 bit at a time. Then, as bits x2, x5, x7 of the input register 415 are cyclically shifted, they are inputted Into 
the logic circuit 416 and XORed; thus the matrix operation of the first right column of expression 3 is performed. 
[0146] The constants In the second column from the right in expression 3 are set In the registerfor holding constants 
417, starting from the lower bits. As the values of the registerfor holding constants 417 are cyclically shifted, the value 
of the lowest-order bits is inputted into the logiecircuft41 6 and XOR operations are performed; thus the matrix operation 
of the second column from the right of expression 3 is performed. * 
[01 47] When data Is set at the input register 41 5 in this manner, at the first clock cycle, operations are performed on 
yO, and the result Is then stored in the output register 41 8. Operations are then performed in order so that with 8 clock 
30 cycles the operations on (y7, y6, y5, y4, y3 y2, yl yO) are completed. The logic circuit 41 8 can in this case execute the 
operation processing of expression 4 using 3 XOR circuits. 

[0148] The use of the present hvention enables the implementation of the AES block cipher algorithm in a compact 
circuit through the division of data to be processed by specified circuits into predetermined execution block lengths. 
Abo, through the sharing of cfrcuits for processing for encryption as circuits for processing for decryption, as well as 

35 . the sharing of some circuits by key schedule unit and the round function unit, the scale of circuit can be further reduced. 
[0149] While only selected embodiments have been chosen to illustrate the present invention, to those skilled in the 
art It will be apparent from this disclosure that various changes and modifications can be made herein without departing 
from the scope of the Invention as defined in the appended claims. Furthermore, the foregoing description of the em- 
bodiments according to the present Invention is provided for Illustration only, and not for the purpose of limiting the 

40 Invention as defined by the appended claims and meJr equivalents. 

Claims 



25 



45 



! . An encryption circuit that generates from a cipher key a plurality of round keys having a number of bits correspond- 
ing to a predetermined processing block length and executing, for each processing block length, input data and 
round key encryption/decryption processing, by means of a round function unit comprising an XOR operation unit 
that XORs the input data and one of the round keys and a round processing unit that iterates round processing 
that Includes Byte Sub transformation, Shift Row transformation, Mix Column transformation and Round Key Ad- 
so ditlon, wherein: 

said round processing unit comprises: 

a first selector that segments input data into execution block lengths smaller than said processing block 
55 length; a first Round Key Addition circuit that adds said round key value to input data for each.sald exe- 

cution block length; 

an intermediate register/Shift Row transformation circuit that temporarily stores the output of said first 
Round Key Addition circuit and executes Shift Row transformation using said processing block length; 
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a Byte Sub transformation circuit wherein 

said Intermediate reglster/Shfft Row transformation circuit value is inputted for each said execution block 
length and Byte Sub transformation is executed; a second Round Key Addition circuit wherein 
said Intermediate registeryshift Row transformation circuit value Is inputted for each said execution block 
s length and said round key value is added for each said execution block length; 

a Mix Column transformation circuit executing Mix Column transformation on the output of said second 
Round Key Addition circuit; and 

a second selector that outputs to said first Round Key Addition circuit one output from among the outputs 
of said first selector, intermediate register/Shift Row transformation circuit, Byte Sub transformation circuit, 
10 or Mix Column transformation circuit. 

2. An encryption circuit according to claim 1 wherein said execution block length is a multiple of 8 bits. 

3. An encryption circuit according to claim 1 , wherein said processing block length Is 128 bits and said execution 
is block length Is 32 bits. 

4. An encryption circuit according to claim 1 , wherein the key length of the cipher key is any of 128 bits, 1 92 bits or 
256 bits. 

20 5. An encryption circuit according to claim 1 , wherein: 

said Byte Sub transformation circuit comprises a matrix operation unit for decryption that executes a matrix 
operation on input data; 

a third selector that outputs either the input data or the output of said matrix operation unit for decryption; 
25 an inverse operation unit for executing an inverse operation on tfre data outputted from said third selector; a 

matrix operation unit for encryption that executes a matrix operation on the data outputted from said inverse 
operation unit; and a fourth selectorthat outputs eitherthe output of said inverse operation unit or. the output 
of said matrix operation unit for encryption. 

30 6# * n encr yptlon circuit according to daim 5;. wherein said matrix operation unit for decryption and said matrix oper- 
-• ationunJt for encryption comprises an XOR circuit so as to perform 8-bit operations atone dock cycle. 

An encryption circuit according to claim 5, wherein said matrix operation unit for decryption and said matrix oper- 
ation unit for encryption comprises an XOR circuit so as to perform 1 -bit operations at one clock cycle. 

Ah encryption circuit according to claim 1 , wherein said intermediate register/Shift Row transformation circuit can 
be used for both encryption and decryption through the reversal of order of Input of shift data relating to amount 
of shift for data to be inputted into said intermediate register/Shift Row transformation circuit, the Input order for 
decryption being the reverse of the order for encryption. 

An encryption circuit according to claim 1 , wherein said Mix Column. transformation circuit comprises a plurality of 
multiplication units with unique multipliers and an XOR circuit that performs XOR operations for said plurality of 
multiplication units, said Mix Column transformation circuit executing a matrix operation between data Inputted 
into each multiplication unit and the multiplier established for each multiplication unit 

10. An encryption circuit according to claim 9, wherein said Mix Column transformation circuit comprises 4 operation 
units having 4 multiplication units capable of 8-blt unit operations and XOR circuits that execute XOR operations 
based on the outputs of said 4 multiplication units. 

so 11 . An encryption circuit according to daim 9, wherein said multiplication units can control 2 multipliers and are used 
for both encryption and decryption. 

12. An encryption drcuit according to claim 11, wherein said multiplication units are constituted to control addition 
values from high-order bits. 

55 

13. An encryption drcuit according to claim 1 having a key expansion schedule drcuit that generates from said cipher 
key, as an expanded key segmented into bit numbers corresponding to said execution block length, a plurality of 
round keys with bit numbers corresponding to a predetermined processing block length; the key expansion sched- 
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ule drcutt comprising: 

a fifth selector that segments a cipher key Into the number of bfts corresponding to said execution block length 
and outputs the same; 

a shift register to which flip-flop circuits are connected at a plurality of stages, said flip-flop circuits latching 
data in unfts of said execution block length; 

a first XOR circuit that XORs the output of the final stage flip-flop circuit of said shift register with one constant 
selected from among a group of constants; 

a sixth selector into which are Inputted the outputs of those flip-flops of said shift register that are involved in 
operations for encryption and the outputs of those flip-flops involved In operations for decryption, and which 
selectively outputs one of these; 

a Rot Byte processing circuit that rotates the output of said sixth selector; 

a seventh selector into which the output of said sixth selector and the output of said Rot Byte circuit is inputted 
and which selectively outputs one of these; 

a Sub Byte processing circuit that executes Byte Sub transformation on the output of said seventh selector 
for each said execution block length; 

an eighth selector Into which the output of said sixth selector and the output of said Sub Byte processing circuit 
are Inputted, and which selectively outputs one of these; 

a second XOR circuit that executes an XOR operation based on the output of said first XOR circuit and the 
output of said eighth selector ; and 

a shift register unit selector that selectively outputs, to those flip-flops of said shift register the outputs of which 
are subject to operations for encryption, either the output of said second XOR circuit or the output of the 
adjacent stage flip-flop. 

14. An encryption circuit according to claim 1 3, wherein said shift register comprises 8 flip-flops executing data process- 
ing in 32-brt units, and said sixth selector Is constituted so that the outputs of the second, fourth, sixth and eighth 
flip-flops from the bottom from among said flip-flops are Inputted therein, and that it outputs one of these. 

15. An encryption circuit according to claim 13, wherein through the Input into said seventh selector of the output of 
said intermediate register/Shift Row transformation circuit and the input Into said second selector of the output of 
said Sub Byte processing circuit, a single circuit can be used for said Sub Byte processing circuit and said Byte 
Sub transformation circuit of said round processing unit. 
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Kay Length == t28bit or 192bit 

KeyExpansIon ( byte Key [4*NlO word W [Nb * ( Nr + 1 ) ] 



for(i = 0:i<Nk:rH-) 

W [I 3 = (Key [4*1]. Key [ 4*i + 13 . Key [ 4 * 1 + 3 3 ) ; 
for(i = IMk;i<Nb*(Mr+1 );i4+) 

{ . ■ 

temp = W [ i - 1 ] ; 
if(..SNk = = 0) 

temp = Sub Byte ( Rot Byte ( temp ) ) * Rcon [ i / Nk] 
WU] = WCi-Nk]-tomp; 



Key Length = = 256bit 

KeyExpansIon (bytei Key [ 4 * Nltf word W [ Nb * < Nr + 1 ) ] 



for(l = 0;i<Nk;H+) , ' 

W[j] = (KeyC4*i].Key[4*i + 1].Key[4*l + 33): 
for < 1 = Nk : l< Nb *( Nr + 1 ) ; H+) 

! " 
temp=W[i-1 3; 
if (i%Nk = = 0) 

temp = Sub Byte ( Rot Byte ( temp ) ) " Rcon [ i / Nk ] ; 
elseSf(.i»Nk* = 4) 



temp = Sub Byte ( temp ) ; 
W[i]=W[i-Nk3 A temp; 



Fig . 2 
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(54) AES Encryption circuit 

(57) A round processing unit in an encryption circuit 
cprriprises: a first Round Key Addition circuit (204) that 
adds a round key value to Input data; an Intermediate 
register/Shift Row transformation circuit (206) that tem- 
porarily stores the output of the first Round Key Addition 
circuit (204) and executes Shift Row transformation; a 
Byte Sub transformation circuit (207) into which the val- 
ues of the intermediate register/Shift Row transforma- 
tion circuit (206) are inputted and which executes Byte 
Sub transformation; a second Round Key Addition cir- 
cuit (208) into which the values of the Intermediate reg- 
ister/Shift Row transformation circuit (206) am Inputted 
and which adds round key values; a Mix Column trans- 
formation circuit (210) that executes Mix Column trans- 
formation upon the outputs of the Second Round Key 
Addition circuit (208); and a second selector (203) that 
outputs to the second Round Key Addition circuit (204) 
one of the outputs of a first selector (202}, the Interme- 
diate register/Shift Row transformation circuit (206), the 
Byte Sub transformation circuit (207), and the Mix Col- 
umn transformation circuit (2 1 0). Such an encryption cir- 
cuit reduces a scale of circuit and can achieve a certain 
level of high-speed processing in the implementation of 
the AES block cipher. 
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